Rank in Wordlist | Frequency | Word |
---|---|---|
6958 | 7 | 1,000 |
6959 | 7 | 10,000 |
6973 | 7 | 5,000 |
7606 | 6 | 2,000 |
7609 | 6 | 300,000 |
7611 | 6 | 4,000 |
8517 | 5 | 25,000 |
8524 | 5 | 6,000 |
9689 | 4 | %, |
9701 | 4 | 150,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
8270 | 6 | off)(Nevermind |
14580 | 2 | 32(0)2 |
14582 | 2 | 33(0)3 |
14592 | 2 | 4(2)(b |
14665 | 2 | A(H1N1 |
16019 | 2 | Intel(R |
16115 | 2 | Keyword(s |
17680 | 2 | author(s |
21098 | 1 | 11(d |
21099 | 1 | 11,2002(Low |
Rank in Wordlist | Frequency | Word |
---|---|---|
8270 | 6 | off)(Nevermind |
8508 | 5 | %) |
11475 | 3 | %). |
14438 | 2 | %), |
14580 | 2 | 32(0)2 |
14582 | 2 | 33(0)3 |
14592 | 2 | 4(2)(b |
21402 | 1 | 17.2.1997)Eurobridgearchetypes |
21596 | 1 | 2(b)(i |
21665 | 1 | 2001)264 |
Rank in Wordlist | Frequency | Word |
---|---|---|
2988 | 23 | 50% |
3726 | 17 | 20% |
4324 | 14 | 40% |
4326 | 14 | 80% |
4577 | 13 | 10% |
4578 | 13 | 100% |
4847 | 12 | 1% |
5503 | 10 | 3% |
5504 | 10 | 30% |
5507 | 10 | 5% |
Rank in Wordlist | Frequency | Word |
---|---|---|
1324 | 58 | R&D |
7835 | 6 | Q&A |
7853 | 6 | S&D |
10277 | 4 | S&T |
11630 | 3 | B&H |
14862 | 2 | B&Q |
15427 | 2 | ECA&D |
16747 | 2 | R&I |
16748 | 2 | R&T |
22411 | 1 | 7&8 |
Rank in Wordlist | Frequency | Word |
---|---|---|
8507 | 5 | $1 |
10366 | 4 | US$ |
14436 | 2 | $25.60 |
14437 | 2 | $700 |
20826 | 1 | $1,000,000 |
20827 | 1 | $1,35 |
20828 | 1 | $1.4 |
20829 | 1 | $1.8 |
20830 | 1 | $10 |
20831 | 1 | $1000s |
Rank in Wordlist | Frequency | Word |
---|---|---|
573 | 126 | it's |
1117 | 68 | It's |
1125 | 68 | don't |
1137 | 68 | they're |
1394 | 56 | that's |
1422 | 54 | That's |
1521 | 50 | I'm |
1918 | 39 | Europe's |
1967 | 39 | you're |
2220 | 33 | EU's |
Rank in Wordlist | Frequency | Word |
---|---|---|
1658 | 46 | and/or |
6979 | 7 | 9/11 |
9617 | 5 | to/from |
9728 | 4 | 24/7 |
9741 | 4 | 874/2004 |
10014 | 4 | GUE/NGL |
10029 | 4 | Greens/EFA |
10868 | 4 | his/her |
12178 | 3 | Left/Nordic |
12588 | 3 | TCP/IP |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots